Eliminating Invalidation in Coherent-Cache Parallel Graph Reduction

نویسندگان

  • Andrew J. Bennett
  • Paul H. J. Kelly
چکیده

Parallel functional programs based on the graph reduction execution model display considerable locality of reference, favouring the use of large cache lines in the implementation of the shared heap on a shared-memory multiprocessor. They also display a very high rate of synchronisation, making conventional weakly-consistent coherency protocols ineeective at avoiding unnecessary contention for write access to cache lines due to false sharing. We present the design of a specially adapted cache coherency protocol and show results of simulation experiments which demonstrate that the protocol allows spatial locality to be exploited to at least the level of a conventional invalidation protocol, but without the unnecessary serialisation and network transactions caused by false sharing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Locality and False Sharing in Coherent-Cache Parallel Graph Reduction

Parallel graph reduction is a model for parallel program execution in which shared-memory is used under a strict access regime with single assignment and blocking reads. We outline the design of an ee-cient and accurate multiprocessor simulation scheme and the results of a simulation study of the performance of a suite of benchmark programs operating under a cache coherency protocol that is rep...

متن کامل

Localtiy and False Sharing in Coherent-Cache Parallel Graph Reduction

Parallel graph reduction is a model for parallel program execution in which shared-memory is used under a strict access regime with single assignment and blocking reads. We outline the design of an ee-cient and accurate multiprocessor simulation scheme and the results of a simulation study of the performance of a suite of benchmark programs operating under a cache coherency protocol that is rep...

متن کامل

Sharing and Contention in Coherent - cache Parallel

Parallel graph reduction is a model for parallel program execution in which shared memory is used under a strict access regime with single assignment and blocking reads. We present the design of an eecient and accurate multiprocessor simulation scheme and the results of a simulation study of the pattern of access of a suite of benchmark programs.

متن کامل

To Appear in the 1996 International Conference on Parallel Processing

{ The presence of procedures and procedure calls introduces side eeects, which complicates the analysis of stale reference detection in compiler-directed cache coherence schemes 4, 6, 8]. Previous compiler algorithms use cache invalidation at procedure boundary 5, 7] or inlining 7] to avoid reference marking interprocedurally. In this paper, we introduce a full interprocedural algorithm, which ...

متن کامل

Reducing Coherence Overheads with Multi -‐ line Invalidation ( MLI ) Messages

Most multiprocessors employ coherent caches despite the overheads of doing so. As future processors will be multiprocessors with elaborate cache hierarchies, the overheads of cache coherence will be an important area for efficiency improvements. This paper proposes a novel technique, called Multi-Line Invalidation (MLI) messages, to reduce an important aspect of coherence overhead—the invalidat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994